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Digital signal processing integrated circuit with IO connections 



The invention relates to digital signal processing and in particular to digital 
signal processing circuits with multiple programmed digital signal processors that operate in 
parallel. 

Digital signal processing circuits are ubiquitous in modem consumer 
5 electronic equipment. Signal processing circuits differ from general computers in that they 
receive and/or output real-time or near real time signal streams, such as audio or video signal 
streams, for reception, transmission, rendering, recording purposes etc. The same processing 
operations are applied over and over again to data from these streams for an indefinite time 
period. Application specific equipment is typically permanently programmed to perform just 

1 0 these processing operations. 

Real time or near real-time signal stream processing is needed, which requires 
a high processing capacity. US patent No. 6,456,628 discloses how multiple circuit boards 
with digital signal processors can be used in parallel to provide high processing capacity. In 
order to fully utilize the processing capacity of parallel processors communication 

15 bottlenecks are counteracted. For this purpose the digital signal processors in this patent 
have, in addition to a shared bus connection, local (single processor to single processor) 
communication connections between selected pairs of digital signal processors. Moreover, 
each digital signal processor has its own external I/O interface outside the shared bus to a 
daughterboard. This is typical for digital signal processing. In general computers the I/O 

20 interfaces are normally connected via the shared bus, but in a digital signal processing system 
this would interfere with the real time nature of most signal streams. 

In operation, each digital signal processor is typically programmed to execute 
a respective different part of a complex processing operation. Typically, a front-end digital 
signal processor is programmed to input a stream of signal samples and to perform a first 

25 processing operation repeatedly on respective samples of the stream. The front-end digital 
signal processor passes the results of the first processing operation to a next digital signal 
processor, via a local connection. The next digital signal processor is programmed to perform 
a second processing operation and to pass its results and so on, on to a back-end digital 
processor outputs a stream of processed signal samples. 
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Obviously, the cost of this type of multi-processor system can be controlled by 
adapting the number of digital signal processor boards to the needs of an application. For a 
given application a set of programs is written for the different signal processors, the 
necessary number of digital signal processing boards is installed and the stream inputs and 
5 outputs are coupled to the external I/O interface of the processing boards that form the front- 
end and back-end of the system. 

Integration of such a signal processing system seems straightforward. One 
merely has to realize the circuits from the circuit boards in an integrated circuit and connect 
input and output pins of the integrated circuit to the external I/O interfaces of the digital 

10 signal processors that form the front-end and back-end of the system. 

However, to reduce design costs, it is desirable that, once such a type of signal 
processing integrated circuit has been designed for a specific application and prepared for 
production, the design can easily be modified to make new types of signal processing 
integrated circuits for related applications. As far as this only involves replacing the programs 

15 of the original type, such a redesign can be made at low cost. But it has been found that 

inefficiencies arise when such an integrated circuit has to be reprogrammed, when the inputs 
and outputs are connected specifically to digital signal processors that are chosen for the 
original application. A reprogrammed application can become inefficient if the front end and 
back end functions cannot be moved from specific digital signal processors. On the other 

20 hand it is also inefficient to increase signal traffic between the digital signal processors to 
transport signal data to and from the front end and back end processors when front end or 
back end processing is moved from the processors that have I/O connections for the signal 
streams. 

Among others, it is an object of the invention to provide for a signal 
25 processing integrated circuit with a plurality of interconnected digital signal processors that 
can efficiently be reprogrammed. 

Among others, it is an object of the invention to relax constraints on the 
movement of functions over different digital signal processors in an integrated circuit with a 
plurality of interconnected digital signal processors. 
30 Among others, it is an object of the invention to facilitate reduction of power 

consumption in an integrated circuit. 

According to the invention a signal processing integrated circuit is provided 
with a configurable multiplexing circuit between an IO connection and a plurality of signal 
processors ("IO connection" as used herein means a connection external to an array of digital 
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signal processors in the integrated circuit; the IO connection may connect to a peripheral 
circuit in the same integrated circuit). The configurable multiplexing circuit can be 
configured with configuration data. The multiplexing circuit receives IO control signals from 
the plurality of signal processors but gives effect only to IO control signal from signal 
5 processors that are selected by the configuration data. Thus, within a single integrated circuit 
design different digital signal processors can be configured to perform a front-end task that 
inputs a stream of data in a distributed signal processing operation, dependent on the 
distribution of the tasks over an array of digital signal processing circuits. The same goes for 
back end tasks that output a stream of data. 

10 Preferably, the IO port of at least one of the digital signal processors is 

coupled to a plurality of such configurable multiplexing circuits for different IO connections, 
at most one of the multiplexing circuits being configured to respond to IO control signals 
from that digital signal processor. Thus, the digital signal processor can be configurably 
attached to a selected IO connection. No conflicts between different digital signal processors 

15 arise at the different IO connections, because the configuration data ensures that only one 

digital signal processor has access to the IO connection. Therefore, one may omit arbitration 
between connected digital signal processors for access to the IO connection, by always 
granting any configurably selected digital signal processor unconditional access. This saves 
time and circuit area. 

20 In an embodiment the multiplexing circuit is configured to respond to selected 

IO addresses from the IO ports of selected digital signal processors. When the multiplexing 
circuit recognizes a configured IO address from a digital signal processor the multiplexing 
circuit enables access by that digital signal processor. Preferably different IO addresses are 
configurably associated with different control signal values for a peripheral circuit that is 

25 coupled to the IO connection. Thus, no peripheral specific connections are needed at the 

digital signal processors, making it easy to use different ones of the digital signal processors 
as front or back end. 

Preferably, the configuration data is set under control of programs executed by 
the digital signal processing circuits themselves. This makes configuration self-contained. In 

30 addition it makes it possible to perform dynamic switching between signal processing tasks 
under program control, wherein the execution of front end and/or back end tasks is shifted 
from one digital signal processor to another. 

In an embodiment, the digital signal processors of a given integrated circuit 
are permanently programmed, using a ROM program memory for example, and the 
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multiplexing circuit is likewise permanently configured to couple an IO connection 
permanently to a single digital signal processor. In this case the fact that the connection is 
programmable has no function in a given integrated circuit, but merely makes it possible to 
manufacture different integrated circuits with different programs, in which the IO 
5 connections are coupled to other digital signal processors with a minimum of design changes. 



These and other objects and advantageous effects of the invention will be described by means 
of examples illustrated in the figures. 
10 Figure 1 shows an integrated signal processing circuit. 

Figure 2 shows an architecture of a digital signal processor 

Figure 3 shows a multiplexing circuit 

Figure 4 shows a control signal interface in a multiplexing circuit 
Figure 5 shows a data signal interface in a multiplexing circuit 
15 Figure 6 shows an embodiment of a translation circuit 



Figure 1 schematically shows an integrated signal processing circuit. The 
integrated signal processing circuit contains an array of instruction based digital signal 

20 processors 10, an external input 1 la, an external output lib, multiplexing circuits 12 and 
peripheral circuits 14. Digital processors 10 have neighbor interconnections to their nearest 
neighbors (although a 2x2 matrix of digital signal processors 10 is shown with two nearest 
neighbor connections for each digital signal processor 10, it will be appreciated that 
differently sized matrices may be used, with different numbers of neighbor connections, for 

25 example only to nearest neighbors, or to nearest neighbors and next nearest neighbors etc.). 

External input 1 la is coupled to a first terminal of a first multiplexing circuit 
12 via a peripheral circuit 14. First multiplexing circuit 12 has second terminals coupled to 
each of digital signal processors 10. External output 1 lb is coupled to a first terminal of a 
second multiplexing circuit 12 via a peripheral circuit 14. Second multiplexing circuit 12 has 

30 second terminals coupled to each of digital signal processors 10. Although single lines are 
shown, it should be understood that each terminal, in fact, may contain a plurality of 
connections in parallel, such as a number of data connections for respective data bits, a 
number of address connections for respective address bits and control connections such as a 
read/write and/or an enable connection. 
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Figure 2 shows an example of an architecture of a digital signal processor 10. 
In this architecture digital signal processor 10 contains an instruction processing core 20, a 
read only program memory 22, a register file 24, communication ports 25 for coupling to a 
neighboring digital signal processors (not shown) in the matrix, an IO port 26 for coupling to 
5 multiplexing circuits 12 (not shown) and a data memory 28 (additionally a separate 

coefficient memory (not shown) may be provided when using a double Harvard architecture) 
. Although IO port 26 is shown as a single line, it should be understood that each may 
represent a plurality of signal conductors, e.g. six IO address conductors and thirty-two data 
conductors and control conductors. Communication ports 25 similarly may comprise many 

10 conductors. Instruction processing core 20 has an instruction fetching input coupled to 

program memory 22, operand read and write inputs and outputs coupled to ports 25, 26 and a 
data read/write interface coupled to data memory 28. 

In operation, when the integrated circuit is started up, signal processors 10 
write control data to multiplexing circuits 12. The control data values for respective ones of 

1 5 the multiplexing circuits are determined by read only data from program memories 22 of 
digital signal processors. The control data controls each multiplexing circuit 12 to establish 
selective connections between digital signal processors 10 and the relevant external 
connection 1 la,b, via a peripheral circuit 14. 

Subsequently, digital signal processors execute signal processing instructions 

20 from their program memories 22, to perform signal processing operations. The program for a 
first and second single one of the digital signal processors 10 also includes instructions to 
read and write signal data from and to the ports coupled to the first and second multiplexing 
circuit 12 respectively. Typically the circuit is arranged so that these instructions are 
dedicated IO read and write instructions with an instruction code specific for reading or 

25 writing from an IO port such as the one connected to multiplexing circuit 12, but instead the 
circuit may be arranged so that register read instructions with a register address 
corresponding to the multiplexer port can be used to access an IO port, or memory mapped 
read and write instructions that can also be used to address data memory 28 may be used to 
access an IO port, when appropriate addresses are used. 

30 Digital signal processors 10 also communicate signal data via the nearest 

neighbor connections, so that digital signal processors that do not read or write directly to 
multiplexing circuits 12 can also perform signal processing operations using intermediate 
signal values computed from signal values at the external input 11a and affect signal values 
at the external output lib. 
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Figure 3 shows an embodiment of a multiplexing circuit 12, containing a 
connection circuit 30, translation circuits 34, and an update circuit 38. Each translation circuit 
34 has inputs coupled to the address lines and control lines of a respective one of digital 
signal processors 10 (not shown) and an output coupled to connection circuit 30. Data lines 
5 of a respective one of digital signal processors 10 (not shown) are coupled directly to 

connection circuit 30. Connection circuit 30 has IO control outputs and data inputs/outputs 
(32 bits per input for example) coupled to peripheral circuit 14. Connection circuit 30 is 
arranged to pass data between a selectable one of the digital signal processing circuits and 
peripheral circuit 14. Connection circuit 30 is also arranged to pass control data from the 

10 translation circuits 34 to peripheral circuit 14. 

It will be appreciated that, although single data and control lines are shown 
connected to peripheral circuit 14, in fact more than one connection may be provided in 
parallel. The number of control lines to peripheral circuit 14 may even be different for 
different peripheral circuits, dependent on the type of peripheral circuit 14. 

15 In operation, digital signal processors 10 initially write configuration data to 

translation circuits 34. Each translation circuit 34 corresponds to a respective combination of 
a digital signal processor 10 and a peripheral circuit 14. The configuration data for each 
particular translation circuit 34 specifies whether IO control signals and/or data from the 
corresponding digital signal processor 10 should be exchanged with the corresponding 

20 peripheral circuit 14 or not. If so, the configuration data for the particular translation circuit 
34 specifies which one or more IO addresses that, when supplied by the corresponding 
processor 10 on its IO port 26, the particular translation circuit 34 should respond to. 
Preferably, the configuration data also specifies the control signal or signals that the 
particular translation circuit 34 must supply to the corresponding peripheral circuit 14 when 

25 the corresponding digital signal processor 10 supplies respective addresses. 

For example, one IO address may be specified only, so that the particular 
translation circuit 34 passes data from IO port 26 only if the corresponding processor 10 
supplies that IO address. In another example, several IO addresses may be specified, each in 
combination with a respective set of control signals for the corresponding peripheral circuit 

30 14, e.g. one address for supplying data from a status register in peripheral circuit 14 to IO 

port 26 and another address for supplying signal data from a signal data register in peripheral 
circuit 14 to IO port 26. 

Typically, each digital processor 10 writes configuration data so that the 
translation circuit 34 for at most one of the peripheral circuits 14 will respond to one or more 
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IO addresses from the digital signal processor 10, and different digital signal processors 10 
typically write configuration data so that the translation circuit 34 for different peripheral 
circuits 14 will respond to one or more IO addresses from different digital signal processor 
10. Thus, each peripheral 14 is accessed by a respective one of digital signal processors 10. 
5 However, in an embodiment, the configuration data allows multiple digital signal processors 
10 to access the same peripheral 14, each using its own defined IO addresses. In this or 
another embodiment, the configuration data allows a digital signal processor 10 to access 
multiple peripherals 14. Typically, in this case, the digital signal processor 10 sets the 
configuration data so that translation circuits 34 for the different peripherals 14 will respond 
10 to different addresses. Thus, the digital signal processor is able to address different 
peripherals by issuing different IO addresses. However, in an embodiment the same 
addresses may be configured, e.g. combinations of data from different peripherals must be 
read. 

In the embodiment shown digital signal processors 10 supply the configuration 

15 data via the data lines of the IO ports. Update circuit 38 detects whether a specific IO address 
associated with a configuration update is supplied and if so update circuit 38 makes 
translation circuit 34 update the configuration data with data from the IO port 26 of the 
digital signal processor 10. For example one predetermined IO address from IO port 26 may 
be used to signal that the accompanying data from IO port 26 represents one or more IO 

20 addresses for which configuration data is to be stored. Optionally another predetermined IO 
address may be used to signal that the accompanying data from the IO port 26 represents 
configuration data to be stored in association with a previously specified address. 

After writing the configuration data signal processors 10 start executing signal 
processing programs from program memory 22. During execution of the signal processing 

25 programs, digital signal processors 10 execute IO instructions, which result in the application 
of instruction dependent IO addresses on IO port 26. The translation circuits 34 that are 
coupled to the IO port detect whether the IO addresses match configured addresses for the 
translation circuits 34. If the IO address matches for a particular translation circuit 34, the 
translation circuit 34 controls connection circuit 30 to pass data between the digital signal 

30 processor 10 and the peripheral that correspond to the translation circuit and/or to pass 
control signals that have been defined for the IO address by the configuration data. 

For example, in response to a configured address a translation circuit 34 
supplies read control data to the peripheral circuit 14 via connection circuit 30 and controls 
connection circuit 30 to pass read data from peripheral 14 to IO port 26. In another example, 
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in response to a configured address a translation circuit 34 supplies write control data to the 
peripheral circuit 14 via connection circuit 30 and controls connection circuit 30 to pass write 
data from IO port 26 to peripheral 14. 

Figure 4 shows an embodiment of a control part of connection circuit 30. This 
5 embodiment contains a plurality of OR gates 40, each with a respective set of inputs 44a-c. 
Three OR gates 40 are shown by way of example, but it should be understood that the 
number of OR gates depends on the number of control connections of peripheral circuit 14. 
The inputs 44a-c of each OR gate 40 are coupled to outputs of respective ones of the 
translation circuits 34. The outputs of the OR gates 40 are coupled to the peripheral circuit 

10 14. Thus, in operation, the peripheral circuit 14 receives the logic OR of output signals from 
translation circuits 34 that correspond to different digital signal processors 10. In this 
embodiment translation circuits 34 are arranged so that they supply logic zeros when they do 
not receive the configured addresses. 

A noteworthy point is that no arbitration is performed. Conflicts due to 

15 simultaneous IO addresses on IO ports 26 of the different digital signal processors 10 are 
avoided by the configuration of the multiplexing circuit 12, which responds only to 
configured ones of the digital signal processors. As a result, a normal fast logic circuit can be 
used between the IO ports and the peripheral circuit 14, without tri -state drivers or arbiters 
that suffer from metastable conditions. Access is guaranteed within one or a predetermined 

20 number of cycles to ensure real-time behaviour and system predictability. 

Figure 5 shows an embodiment of a data part of a connection circuit. This 
embodiment contains N to 1 switches 42 for respective bits of the IO data (N being the 
number of digital signal processors 10), with first side connections 48a-d of each switch 42 
coupled to the IO ports of respective ones of the signal processors 10. Second side 

25 connections of each switch 42 are coupled to the peripheral circuit 14. Switches 42 are 

controlled via control lines 46 from translation circuits 34 which control whether switches 42 
connect the data lines of the digital signal processor 10 that correspond to translation circuit 
34 to peripheral circuit 14. In operation, translation circuits 14 selectively activate the control 
lines dependent on whether IO addresses from digital signal processors 10 match configured 

30 addresses. 

In an embodiment switches 42 are implemented by two-way switching 
elements (not shown). In another embodiment tri-state drivers may be used, which drive 
signals from the peripheral circuit 14 to the IO ports 26 of the digital signal processors 10 or 
vice versa, dependent on whether a read or write control signal is received from the IO port, 
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if the IO address on the IO port 26 matches a configured address. In general, no arbitration 
circuit is provided to exclude that more than one of the digital signal processors 10 writes to 
the same peripheral circuit 14 at the same time, or that more than one peripheral circuit 14 is 
connected to the same signal processor 10 at the same time: the signal processors operate in 
5 lock-step and are programmed so that such conflicts do not occur, or if they occur, that no 
damaging effect will result. 

In yet another embodiment separate data read and write line may be provided 
in IO port 26. In this embodiment logic circuits may be used instead of switches to form write 
signals to the peripheral circuits and read signals to the IO ports 26 of the digital signal 
10 processors 10. The write signals for a peripheral circuit 14 are formed for example as the 
logic OR of enabled data from different digital signal processors 10, enablement being 
controlled by translation circuits 14. The read data may be formed as the logic OR of enabled 
data from different peripheral circuits 14, enablement being controlled by translation circuits 
14. 

15 Each translation circuit 14 may be realized as a memory for example, with 

memory locations for different addresses. In this embodiment the IO address from digital 
signal processor 10 is used to address the memory and data from the memory is supplied to 
connection circuit 30 as control data. During configuration appropriate control data is stored 
in memory locations with addresses selected by digital signal processors 1 0. 

20 Figure 6 shows another embodiment of a translation circuit. This embodiment 

contains address registers 60, address comparators 62 and a control signal generating circuit 
64. Address comparators 62 have first inputs coupled to the address input from IO port 26 of 
a corresponding digital signal processor 10 and second inputs coupled to outputs of address 
registers 60. Address comparators 62 have outputs coupled to control signal generating 

25 circuit 64. Address registers 60 have inputs coupled to the data lines of the IO port 26 of the 
corresponding digital signal processor 10 and clocking inputs coupled to an output of update 
circuit 38. Although two comparator-register pairs are shown, it should be understood that 
any number (e.g. one, two, three or more) may be used. 

In operation, address registers 60 latch data from the data lines of IO port 26 

30 of the corresponding digital signal processor 10 when the digital signal processor 10 outputs 
a predetermined address on its IO port 26. Typically, different fields from the data are latched 
in different registers. As an example, four addresses from different six bit address fields from 
a 32 bit data word may be latched in address registers 60. During normal operation 
comparators 62 compare the latched addresses with addresses supplied from the IO port 26 of 
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the corresponding digital signal processor 10. If a match occurs a signal is supplied to control 
signal generating circuit 64, which generates one or more control signal for peripheral circuit 
14 and/or connection circuit 30 in response. In an embodiment, control signal generating 
circuit 64 defines predetermined control signals for respective ones of its inputs and generate 
5 the predetermined control signal when a signal is received from a comparator at a specific 
input. In another embodiment the control signals may be programmable as well. The use of 
comparators instead of an addressable memory has the advantage that faster translation is 
possible, without requiring additional clock cycles. 

Although the invention has been described using a specific embodiment it will 

10 be understood that many variations are possible without deviating from the invention. For 
example, although an array of digital signal processors 10 was shown wherein all digital 
signal processors are coupled to all multiplexing circuits 12 for respective peripheral circuits 
14, it will be understood that in practice, especially if large arrays of digital signal processors 
10 are used, only a subset of a plurality of digital signal processors 10 may be coupled to a 

15 multiplexing circuit 12. For example, if the digital signal processors 10 have mutual neighbor 
connections according to a matrix of rows and columns, a single column of digital signal 
processors 10 may be coupled to a multiplexing circuit only. This maintains flexibility in the 
location of signal processing programs, but saves on circuitry e.g. for translation circuits 
It will be understood that any number of digital signal processors 10 may be used, connected 

20 in any possible topology. 

As another example, instead of programming the configuration data from the 
digital signal processors, a common control processor may be used to program the 
configuration data. A micro-controller core may be added to the integrated circuit for this 
purpose, and arranged to execute a program that writes the configuration data to the 

25 multiplexing circuits 12. This has the advantage that the programs of digital signal processors 
10 do not need to contain any information about configuration. However, it requires 
additional connections to the micro-controller, and it means that changes to the programs of 
the digital signal processors 10 that involve movement of input or output tasks also involve 
changes of the program of the micro-controller. In another embodiment, the configuration 

30 data is permanently programmed into the multiplexing circuits 12 themselves, e.g. during 
manufacture. 

Although the invention has been described for one-time configuration before 
the start of signal processing, it should be understood that dynamic reconfiguration may be 
provided for in some applications. In this case, after performing a first signal processing 
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operation on a signal stream for some time interval, a switch is made to a second signal 
processing operation, for example in response to a user command. Programs for performing 
both the first and second signal processing operation may be provided in the digital signal 
processors 10. It may be the case that the distribution of signal processing tasks over the 
5 different signal processing circuit is different for the different operations, so that the signal 
streams should enter and/or leave the array of digital signal processors 10 at different points 
in case of the first and second signal processing operation. If so, the digital signal processors 
1 0, or a common controller, preferably write new configuration data to the multiplexing 
circuit before the digital signal processors 10 start to execute the second signal processing 
10 operation. 

Preferably, the configuration data is arranged so that IO addresses from an IO 
port 26 will result into a connection to a single peripheral circuit 14 at a time (by configuring 
only one peripheral circuit to respond addresses, or by using different addresses for different 
peripheral circuits 14). However, without deviating from the invention it may be possible to 

15 address more than one peripheral circuit 14 with the same address. In this way for example, 
data may be written to more than peripheral circuit 14 at a time or, if the peripheral circuits 
14 drive different subsets of the data lines of the IO port 26, a combination of data from more 
than one peripheral circuit 14 may be read. 

Furthermore, it will be understood that the invention is not limited to 

20 implementation details of the examples. For example, some or all digital signal processors 10 
may have more than one IO port, in which case part of the IO ports may be connected to part 
of the multiplexing circuits and other IO ports may be connected to other multiplexing 
circuits. Any address and data word width and interface type may be used for the IO ports. 

Any type of peripheral circuit 14 may be used with its own dedicated interface 

25 type, to produce or consume a series of signal values, where the series in principle may 

continue for any indefinite time interval. For example an audio input peripheral may be used, 
which streams left and right channel sound sample values. In this case, different IO addresses 
may correspond to reading of left and right channel data and repeated reading with the same 
control signals is used to read sample values for successive sampling time points. In another 

30 example an audio output peripheral circuit may be used, or a video signal input/output, or the 
peripheral may be a dedicated processing circuit, such as a DCT transform circuit, an error 
correcting encoder or decoder etc., wherein repeated reading or writing with the same control 
signals is used to read sample values for successive sampling time points or image locations 
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for example. Each may require its own form of control signals, which are all generated by the 
digital signal processors 10 by issuing respective addresses at the IO ports 26. 



